
/*


	Information on the data, programs, figures and tables of the paper "National Institutions and Self-Insurance" by Raphael Godefroy and Joshua Lewis




1. Data

1.1 [Public] GAEZ datasets on potential productivity by crop category, crop failure by year and by crop category, cultivation by crop category + FAO crop prices + Distance to border + Country 

File: public_data/gaez_africa.dta

Source: https://gaez.fao.org/ and https://un-fao-gaez-4-1-nycgov.hub.arcgis.com/ and Arcgis

One observation by latitude and longitude 


Variables: point_x (longitude), point_y (latitude), CROP_caparhigh (potential yield high capacity rainfed), CROP_1986 (potential yield in year 1986), ... , CROP_2000 (potential yield in 2000), CROP CATEGORY_areat (area cultivated),CROP CATEGORY_prdnt (production),  ncropswall (total number crops cultivated, including crops not included in selected set of crops), careawall (total area cultivated, including crops not included in selected set of crops), country, adist (distance to border), border_id (closest border), CROP_priceusa (prices for USA 2000), CROP_priceother (prices for France 2000)



Remark A. Land productivity (e.g. potential yield or probability of crop failure) is observed at the crop level, but area cultivated is observed at an aggregated crop category level. 
Since some estimations require to map land productivity to area cultivated, we approximate the productivity of a crop category as follows:
Crop category used for area cultivated 		< -- > Crop used for productivity such as probability of crop failure
cassava or yam 								< -- >  cassava
cocoa, coffee or tea						< -- >  cocoa
potato or sweet potato 						< -- > 	sweet potato
banana cocoa coconut 						< -- > 	coconut
vegetables 									< -- >  tomato
pulses 										< -- >  phaseolus
sugar cane 									< -- >  Sugar cane
sugar beet 									< -- >  Sugar beet
olive 										< -- >  Olive
rice 										< -- >  rice
millet 										< -- >  millet
maize 										< -- >  maize
groundnut 									< -- >  groundnut
cotton 										< -- >  cotton
rapeseed 									< -- >  rapeseed
sunflower 									< -- >  sunflower
soybean 									< -- >  soybean
wheat 										< -- >  wheat
other cereals 								< -- >  barley


This mapping is based on a visual inspection of the most grown crops for each category of crops for which the area cultivated is provided, and the availability of information on crop failure.

Remark B. To make potential yield and production by crop comparable, we divide raw potential yield by the number indicated in the conversion table A6-3 from the GAEZ user guide https://www.gaez.iiasa.ac.at/docs/GAEZ_User_Guide.pdf

Remark C. The source for crop prices in  USA and France in 2000 is https://www.fao.org/faostat/en/#data




1.2 [Public] Dataset of coordinates of ethnic homelands following Murdock 

Source: Murdock atlas and Arcgis, provided by Stelios Michalopoulos and Lucienne Talba

File: public_data/distance_murdock_all.dta

One observation by latitude and longitude

Variables: point_x (longitude), point_y (latitude), ethnic_name (name of ethnic group), border_id (closest border), country



1.3 [Private] World Bank dataset on Rule of Law index

Source: https://info.worldbank.org/governance/wgi/

File: private_data/wgidataset.dta

One observation by country

Variables: country, rle [Estimate for 1996]

 

1.4 [Public] Dataset of population and closest city (>100000 inhabitants) 

Source: Arcgis and GAEZ

File: public_data/urbanizationbypoint.dta

One observation by latitude and longitude [this dataset is used in robustness checks only, and contains observations for points used in the main regressions only]

Variables: point_x, point_y, distcity, pop00



1.5 [Private] Demographic and Health Surveys household datasets for the following countries

Source: private_data/https://www.idhsdata.org/idhs/ and Arcgis to obtain point_x and point_y coordinates of dhsid

One observation by household

Variables:  point_x, point_y, dhsid sample hhid country livestockyn fridgehh mobphone electrchh bankacc aglandyn hhrelate




2. Programs


All the programs are in the folder programs

The program edcc_alltables launches all the other programs

Every program file indicates the figure or table in the working paper it makes


Remark: for every estimation, above almost every estimation command, a command for the OLS regression with standard errors clustered at the border level is written after a star * (and hence will not run) . We have found convenient to have them there for comparison. 


3. Figures and tables


All the figures and tables are put in the folder tables



























*/
